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Abstract 

In this note, we discuss the implications of the weak gravity conjecture (WGC) 
for general models of large-held inhation with a large number of axions N. We hrst 
show that, from the bottom-up perspective, such models admit a variety of different 
regimes for the enhancement of the effective axion decay constant, depending on 
the amount of alignment and the number of instanton terms that contribute to the 
scalar potential. This includes regimes of no enhancement, power-law enhancement 
and exponential enhancement with respect to N. As special cases, we recover the 
Pythagorean enhancement of A-flation, the N and enhancements derived by 
Bachlechner, Long and McAllister and the exponential enhancement by Choi, Kim 
and Yun. We then analyze which top-down constraints are put on such models from 
the requirement of consistency with quantum gravity. In particular, the WGC ap¬ 
pears to imply that the enhancement of the effective axion decay constant must not 
grow parametrically with N for N ^ 1. On the other hand, recent works proposed 
that axions might be able to violate this bound under certain circumstances. Our 
general expression for the enhancement allows us to translate this possibility into a 
condition on the number of instantons that couple to the axions. We argue that, at 
large N, models consistent with quantum gravity must either allow super-Planckian 
field excursions or have an enormous, possibly even exponentially large, number of 
dominant instanton terms in the scalar potential. 



1 Introduction 


Even after the recent release of the combined Planck/BICEP2 results [1], the updated 
bound r < 0.12 on the tensor-to-scalar ratio of primordial fluctuations is still well above 
the Lyth bound r < 0.01 [2] such that models of large-held inhation remain viable sce¬ 
narios compatible with observations. In such models, the inhaton undergoes a large held 
excursion in Planck units during inhation and, hence, operators of all mass dimensions 
are relevant in the inhaton potential. Models of large-held inhation are thus sensitive to 
quantum gravity ehects, suggesting that they can consistently be studied only in a string 
theory framework (see [3] for a review of inhation in string theory). 

In this context, axions are particularly well-motivated inhaton candidates since they 
enjoy a continuous shift symmetry which protects the inhaton potential from perturbative 
quantum corrections [4]. The shift symmetry is broken non-perturbatively to a discrete 
subgroup such that the only contributions to the inhaton potential are (multi-)instanton 
terms. Moreover, axions arise naturally in string compactihcations, e.g., from the dimen¬ 
sional reduction of Abelian p-form helds, where the shift symmetry is a 4D remnant of 
the higher-dimensional gauge symmetry. 

Unfortunately though, it turns out that it is difficult to realize explicit models of 
axion inhation with a large axion decay constant in string theory. For models with a 
single axion, it was shown in [5] that it is impossible to obtain an axion decay constant 
larger than the Planck mass in many diherent examples. In fact, a general quantum 
gravity argument—the weak gravity conjecture (WGC) [6]— suggests that this is not a 
coincidence but due to a general principle in string theory. According to the WGC, 
held excursions parametrically larger than the Planck mass may simply not be allowed 
in EFTs consistent with quantum gravity (see also [7]). The conjecture was originally 
motivated using the example of an Abelian gauge held but argued to apply more generally 
to systems involving p-form helds. An extension of the conjecture was analyzed in detail 
for the case of multiple U(l)’s in [8]. Furthermore, a number of recent works studied the 
implications of the WGC for multi-axion systems [9-16]. 

In [12], it was shown that contributions to the scalar potential from gravitational 
instantons forbid a super-Planckian held range, conhrming that the WGC indeed bounds 
the held excursion in models with one or several axions. In [13] , the problem was analyzed 
from a diherent point of view. There, the authors found a bound on the axion held range 
by relating the setup to the more tractable case of U(l)’s using T-dualities (see also [11] for 
similar arguments). Both [12] and [13] also discussed a possible loophole by which models 
with one or several axions might be able to evade a bound on the held excursion. In [14], 
the authors advocated the same loophole and argued that gravitational instantons do in 
general not spoil trans-Planckian held excursions in models with many axions. However, 
[13, 16] also gave counter-arguments suggesting that the loophole is unlikely to be realized 
in string theory. On the other hand, string theory constructions possibly evading a bound 
on the held excursion were discussed in [17, 15]. It is therefore fair to say that it is still 
under debate whether or not the WGC implies a strict bound on the axion held range in 
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general models of axion inflation. 

On the other hand, it is well-known that generic “bottom-up” models of axion inflation 
do allow for a parametric enhancement of the held excursion.^ One way to achieve this is 
to introduce monodromies for the axions by switching on couplings to branes or huxes. 
Roughly, this changes the topology of the axion moduli space from a circle into a spiral, 
thus allowing for an enhancement of the naive held range [18-22]. In the absence of 
monodromies, an enhancement of the axion decay constant can be achieved by alignment 
[23] in models with at least N = 2 axions, albeit at the cost of having to tune the 
anomaly coefficients entering the instanton potential to rather large values (see, however, 
[24, 25] for a recently proposed alternative without this requirement). If one allows the 
number of axions to be large, 3> 1, the possibilities for enhancement are richer. Even 
without any alignment, one then hnds a Pythagorean enhancement fes ~ VN via the N- 
hation mechanism [26].^ In [17], a statistical analysis of string theory inspired multi-axion 
systems revealed that some amount of alignment is actually generic at large N, leading 
to a stronger scaling f^s ^ N or even ~ Finally, it was found in [28] that a 

moderate tuning of the anomaly coefficients at large N can even lead to an exponentially 
strong enhancement f^g ~ \fN~\n^, with n an 0{1) number.^ 

In view of these results, there are two main questions we want to address in this pa¬ 
per. First, we are interested in how the enhancement of the effective axion decay constant 
scales with in a completely general bottom-up model of multi-axion inflation (statis¬ 
tically likely in the string landscape or not) with an arbitrary number P of instantons 
contributing to the scalar potential and an arbitrary amount of alignment. To this end, 
we derive an analytic expression for f^g in terms of a recurrence relation and determine 
its scaling with N depending on the dihedral angles and distances of the facets in the 
A^-polytope which bounds the fundamental domain of the axion moduli space. These pa¬ 
rameters then in turn depend on the number of instantons and the alignment. We thus 
End a variety of different regimes in which f^g obeys different scaling laws with respect 
to N, including regimes of no enhancement, power-law enhancement and exponential en¬ 
hancement. As special cases, we recover the a//V enhancement of A^-flation, the N and 
regimes found in [17] and the exponential enhancement found in [28]. Our result 
is useful in that it puts into perspective the different large-A^ behaviors found in these 
works. Specihcally, we will show that, at large N, the regimes of power-law and expo¬ 
nential enhancement are only separated by an inhnitesimal change in the dihedral angles 
of order 0{1/N). Moreover, our general algorithm for /gfj opens up the possibility of a 
broad study of the phenomenology of multi-axion models with arbitrary scalar potential. 

Second, we want to address the question how the enhancement mechanisms discussed 
above are constrained by quantum gravity. In particular, we are interested in the con¬ 
ditions which a general bottom-up model would have to satisfy in order to be consistent 

^ By “bottom-up”, we mean both phenomenological and string theory inspired models which may be 
oblivious to possible constraints from the WGC. 

^ Possible embeddings of fV-flation into type IIB string theory were discussed in [27]. 

^ See also [29-37] for other recent works on axion inflation. 
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with a possible bound on the held excursion. To what extent arguments such as those in 
[13] also hold for models with monodromies is not sufficiently understood so far. Never¬ 
theless, an obstruction to an inhnite held excursion is that tunneling processes become 
probable when the potential energy of the axion becomes too large [38]. In order for 
these ehects to be suppressed, the axion should not start to roll down the potential too 
far away from the minimum, which translates into an upper bound for the held excursion.^ 
In this paper, we study the held excursion in models without monodromies. Our general 
recurrence relation allows us to translate this problem into a bound on the number of 
dominant instanton terms that contribute to the scalar potential.^ Our main result is 
that, in order for the enhancement to converge to a hnite value at large N, the number 
of dominant instantons would have to grow faster than quadratically and possibly even 
exponentially with N. For the most part, the arguments leading to this conclusion are 
purely geometric and therefore model-independent. 

There are two possible interpretations of this result. If the WGC (or other quantum 
gravity constraints) bound the axion held excursion to be sub-Planckian, this implies that 
models of large-held inhation with multiple axions lie in the swampland because their as¬ 
sumptions on the form of the scalar potential become inconsistent for sufficiently large N. 
However, as we will detail in Section 4, it would then be difficult to evade the conclusion 
that string compactihcations with many axions must admit an enormous number of dom¬ 
inant instantons, which is not expected in a perturbatively controlled regime. Assuming 
that perturbative models with many axions exist, a second interpretation of our result 
may therefore be more reasonable: it should be taken as evidence that models with many 
axions must be able to violate a possible bound on the held excursion. Our result would 
thus lend further credence to the point of view that a parametric enhancement of the 
ehective axion decay constant is compatible with quantum gravity. We stress that this 
argument is independent of and complementary to the loopholes discussed in [12-14]. 

This paper is organized as follows. In Section 2, we review the WGC and its possible 
implications for the held excursion in models of inhation with axions. In Section 3, 
we derive the enhancement of the ehective axion decay constant for a general model 
with N axions in terms of a recurrence relation, which depends on the dihedral angles 
and distances of the facets in the Wpolytope bounding the axion moduli space. In 
Section 4, we relate these parameters to the number of dominant instanton terms in the 
scalar potential and argue that it must grow faster than quadratically and possibly even 
exponentially with N in order for the enhancement to converge at large N. We conclude 
in Section 5 with a discussion of our results. 

^ Another bottleneck is to actually construct an explicit string compactification with stabilized moduli 
which realizes an EFT with the desired properties [39, 40]. 

® We refer to instantons as “dominant” if they are relevant for the scalar potential and bound the 
field excursion along the direction in field space to which they couple. This is to be contrasted with 
subleading (multi-)instantons whose contribution to the scalar potential is negligible. 
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2 The weak gravity conjecture 


The WGC postulates that gravity must be the weakest force in any consistent 4D low- 
energy EFT [6]. More precisely, considering an EFT containing gravity and a U(l) gauge 
held with coupling constant g, there must exist at least one charged particle with mass 




( 2 . 1 ) 


The strong version of the WGC holds if the particle satisfying this bound is also the 
lightest charged particle in the theory, while otherwise (2.1) is referred to as the weak or 
mild version of the WGC. 

The above bound tells us that, whenever we make the gauge coupling g too small, 
new particles will become light and destroy the validity of the EFT we started with. 
This can be seen by applying the above bound also to magnetic monopoles with charge 
1/g. Demanding that their mass should at least be of the order A/g‘^ then yields a 
bound A < gMp for the cutoff of the EFT. The strength of the gauge interaction can 
therefore not be made arbitrarily small or, in other words, gravity is the weakest force. 
The conjecture is motivated by black hole arguments involving remnants and a related 
argument that string theory does not allow global symmetries. In the limit g ^ 0, the 
gauge symmetry would become a global symmetry. This should not be allowed, which is 
reflected by the fact that the scale beyond which new physics corrects the 4D EFT then 
goes to zero, A —0. 

A number of related conjectures for theories consistent with quantum gravity was 
formulated in [41]. In particular, it was argued there that the moduli space of a consistent 
theory must have a hnite volume and that travelling an inhnite distance along a geodesic 
in held space would be accompanied by the appearance of an inhnite tower of extra light 
states, thus leading to a break-down of the theory. A simple example illustrating this 
behavior is the volume of a string compactihcation. If it is taken too large, KK modes 
become light, while, if it is made too small, string states become light and we would have 
to revert to the T-dual description instead. 

It was also argued in [6] that the WGG can be generalized to p-form helds and (p — 1)- 
dimensional objects charged under them. Since an axion has p = 0, the natural objects 
to consider in this case are instantons, where the role of the gauge coupling is played by 
the inverse axion decay constant 1//. Instantons correct the scalar potential of an axion 
by terms of the form 


1/(0) ~ e-^E 


1 — cos 



( 2 . 2 ) 


where the suppression of the correction is controlled by the size of the Euclidean action 
Se- Hence, S'e is the analogue to the mass m in (2.1), which determines whether the 
charged particle is heavy enough to be integrated out in the low-energy EFT. A natural 
guess for a generalization of the bound (2.1) to axions is therefore 


5r< 


/ ’ 


(2.3) 
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Figure 1: The fundamental domain of the moduli space of two axions for iV-flation, 
“naive” alignment and alignment including quantum gravity constraints. iV-flation yields 
an effective axion decay constant fes enhanced by a factor of a/2, which is compatible 
with the WGC as long as fes is not super-Planckian. From the bottom-up perspective, 
alignment can lead to a very large enhancement. However, unless axions can exploit a 
loophole, the WGC predicts that new terms then become relevant in the scalar potential 
and shorten the held range. 

i.e., for any axion with axion decay constant /, at least one instanton should exist sat¬ 
isfying the above bound. This implies that, whenever the axion decay constant is taken 
larger than the Planck mass, we expect large corrections to the scalar potential (2.2) 
from instantons satisfying (2.3) and their higher harmonics, spoiling a super-Planckian 
held range (see Fig. 1). Indeed, such large instanton corrections were the main obstacle 
in [5] to constructing large-held models with a single axion. That (2.3) is the correct 
way of generalizing the WGC to axions was recently also shown in [12] via an explicit 
computation of gravitational instantons and in [13] by relating axions to U(l) gauge helds 
using T-dualities. 

A possible loophole to the bound (2.3) was proposed in [13], where it was argued that 
the instantons satisfying the bound might not be those with the smallest action (which is 
admissible if only the weak version of the WGC holds). These instantons could therefore 
be suppressed, while other instantons, which do not satisfy the WGC bound, would give 
the dominant contributions, thus allowing for a super-Planckian held excursion. The 
same proposal was also advocated in [14]. In a similar spirit, it was argued in [12] that 
the presence of a discrete gauge symmetry can forbid the existence of some instantons 
in models with N axions such that the true bound on S'e can be larger than (2.3) along 
some directions in held space. In [14], it was furthermore argued that such a larger bound 
is even generic at large N. The true bound would then be 

5e < (2.4) 

with a model-dependent exponent A > 0. If this is correct, it should be possible to 
engineer models with an ehective axion decay constant fes ~ N^Mp along some diagonal 
in the axion held space and, hence, obtain super-Planckian held excursions. However, it 
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was argued in [13, 16] that it is unlikely that this loophole can be realized in a consistent 
string theory model. 

Another proposed loophole [10] is that, in certain models, instanton corrections may 
come with extra suppression factors such that they can be suppressed in the scalar poten¬ 
tial even if their Euclidean action is small. In such models, one might be able to satisfy 
the strong WGC and still obtain super-Planckian held ranges. However, an explicit real¬ 
ization of this loophole has not been obtained so far. 

To summarize, it has not been fully understood so far whether or not super-Planckian 
held excursions are allowed by the WGC. Given that future observations might reveal a 
large tensor-to-scalar ratio, it is of course crucial to settle this issue. It should therefore 
be worthwhile to also analyze the problem from a diherent point of view, as we will do 
in the following sections. 


3 The effective axion decay constant 


In this section, we compute the enhancement of the ehective axion decay constant for a 
completely general model with N axions and P instanton terms in the potential. The 
Lagrangian of such a model is of the general form 




N 




i=l 


j=l 



(3.1) 


where Aj, Cij are numerical constants and we have chosen a held basis in which the axions 
are canonically normalized. The periodicity of the instanton potential implies that the 
fundamental domain of the V-dimensional axion moduli space is an V-polytope with 2P 
facets [17]. For N = 2, this would be a polygon with 2P edges, for V = 3, it would be 
a polyhedron with 2P faces, etc. Our goal in this section is to study the enhancement 
in the direction of one vertex of such a general polytope. In Section 3.1, we will hrst 
derive this for a simplihed model in which the diherent facets intersecting at the vertex 
have equal dihedral angles and distances from the origin. We will see that this simple 
toy model already gives us a good qualitative understanding of the scaling behavior of 
the enhancement at large N. In Section 3.2, we will then generalize our results to a 
completely general polytope. 

Our approach is somewhat diherent from the one employed in [17], which also studied 
enhancement in models of the form (3.1). First, we will not use landscape statistics in 
this section but compute the held excursion for an arbitrary polytope, statistically likely 
in the string landscape or not. Second, we will determine the enhancement in terms of 
dihedral angles. We will see that the scaling behavior of the enhancement at large N is 
largely determined by the expectation value of the dihedral angles at the corresponding 
vertex and tends to be further increased when variations are introduced. Furthermore, 
our approach will allow us to relate the axion held range to the number of instantons P 
in Section 4. 
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N = 2 


N = 3 


N = 4 


Figure 2: Normal vectors and the angles between them in a simple model for N = 2, 
N = 3 and = 4. 


3.1 A simple model 

Let us now analyze the enhancement in the direction of a particular vertex for which the 
N {N — l)-facets intersecting at the vertex all have the same dihedral angles and the 
same distances from the origin.® We denote the normal vectors pointing from the origin 
to the facets by with i = 1,..., iV. By assumption, all have the same length 
fi < Mp. Let us also assume that the angle between each pair of the normal vectors 
is the same and denote this angle by a 2 - By slight abuse of terminology, we will refer 
to Q !2 as the dihedral angle in this paper. ‘ Let us furthermore denote vectors normal to 
{N — ? 7 ,)-facets by /„ and their lengths by /„. Consequently, the vector pointing towards 
the vertex is denoted by /^r and its length by /tv.® As an example, consider the case 
P = N, where, in the absence of alignment, the fundamental domain is an Wcube. The 
normal vectors are then orthogonal to each other such that 02 = f, and the enhancement 
of the effective axion decay constant grows like /w/Zi = VN- Our goal is now to work 
out the enhancement for general angles 02 7 ^ f- 

® In principle, more than N facets can intersect at a vertex of an N-polytope. This is ungeneric and 
does not lead to further enhancement compared to the generic case of N facets. 

^ Actually, the dihedral angle is tt — 02 . 

® It is conventional in the literature to dehne the axion decay constant / as ^ times the length between 
two maxima of the cosine potential, which would correspond to twice the distance between a facet 
and the origin. Deviating from this convention saves us factors of tt in numerous expressions. For 
the main quantity of interest, the enhancement /at/Zi, our convention has no effect. Also note that 
the definition of an axion decay constant is somewhat ambiguous in models with more than one axion 
[17, 14]. In this paper, the term “axion decay constant” always refers to distances of facets from the 
origin, while “effective axion decay constant” or “enhancement of the axion decay constant” refers to 
distances of vertices from the origin. 
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We start with the case N 
have two normal vectors 
is then given by 


= 2. Then, at each vertex, two facets intersect such that we 
and fi . We can read off from Fig. 2 that the enhancement 


/2 


/l 

COS {\a2) 


(3.2) 


For N = 3, we have three normal vectors, all with relative angles Q! 2 - We can now 
compute /s recursively from /2 and /i as follows. For convenience, we choose our co¬ 
ordinate system such that fl^\ and ^ lie in the (x,^)-plane (see Fig. 2). We can 
then compute scalar products between the different vectors in order to determine the 
components of /{ ^ in terms of a 2 - This in turn allows us to compute the angle between 
and which we denote by 03 . We find 


03 = arccos 


2 cos^ (|« 2 ) - 1 
cos (|a 2 ) 


(3.3) 


Furthermore, one can verify that lies in the plane spanned by Tand This follows 
from symmetry (see again Fig. 2): flipping the polyhedron along the (T,/ 2 )-plane has 
the effect that and fl^'^ are exchanged. However, since the angle between and 
and the angle between and are equal, must be invariant under the flip. 
Hence, it lies in the {z, / 2 )-plane and its length along this plane is given by its absolute 
value /i. This implies /s = /i/cos/3, which together with fs = / 2 /cos(q !3 — /3) yields 
/s = /i/cos (as — arccos (^))- The last equation can now be solved for /s, with the 
result 


V ff + /I - 2 cos(a3)/i^ 

sin(a3) 


(3.4) 


It is straightforward to continue this iterative process for higher n = 4,..., iV. The 
general recurrence relation is then 


fn 


y/fi + /n-i - 2 cos(a„)/i/„_i 
sin(a„) 


(3.5) 


where the angle is given by 


an = arccos 



/f 1 - cos(a2) 

fn-i tan(an-i) 


+ 


A 

fn-] 


cos(a2) 


(3.6) 


Let us now analyze in detail the different regimes of enhancement as we vary the 
dihedral angle a 2 . We hrst check what happens when we deviate inhnitesimally from the 
Wcube case, i.e., we consider a 2 = | -l- ae. Computing a„ for the hrst few orders, we 
then hnd 


Un 


71 

2 


-|- \/n — lae. 


(3.7) 


In the limit e —)■ 0, we must recover the Pythagorean enhancement. We can therefore 
make the ansatz /„ = y/nfi -f Cnfiae for some coefficient c„. Using these expressions in 
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(3.5) and expanding in powers of e, we find 


c 


n 



(3.8) 


For large n, this becomes c„ ~ \/n + c„_i and, hence, cjv ~ This implies that 

onr expansion of /„ already breaks down for e > 0{1/N). Hence, we expect to leave 
the regime of Pythagorean enhancement for all a 2 which deviate from | by at least this 
magnitnde. For 02 < |, the enhancement will be slower than the law, while it will 
be faster for 02 > |. 

Moreover, there is in fact an npper bonnd for 02 at which the enhancement diverges. 
That snch a bonnd exists is easy to see in two dimensions. There, a vertex is determined 
by two normal vectors, which can at most differ by an angle of 180°, i.e., Q !2 < tt. When 
the ineqnality is almost satnrated, the two facets are almost parallel, and the alignment 
becomes inhnitely large. In three dimensions, three normal vectors that each differ by the 
same angle can at most differ by 120°. By compnting scalar products of N unit vectors 
in N dimensions and demanding that they are all equal to cos(a 2 ); we hnd the general 
rule, 

0(2 < TT — arccos . (3.9) 

For large N, this becomes 

«2 < ^ + ^. (3.10) 

Near this angle, the enhancement diverges, while we approach the Pythagorean regime 
for Q !2 —^ f • 

The different regimes of enhancement can be observed in Fig. 3, where we plotted the 
effective axion decay constant for different angles Q !2 at fixed N. The plot shows that the 
enhancement slows down compared to the a//V law when 02 is smaller than | by at least 
an 0{1/N) number and dies out completely as soon as 02 is sufficiently far away from |. 
If, on the other hand, 02 is larger than | by 0{1/N), the enhancement becomes huge and 
diverges. Note that this also implies that, holding 02 7 ^ | fixed and increasing N, one 
is bound to leave the Pythagorean regime in either of the two directions for sufficiently 
large N. This is plotted in Figs. 4 and 5, where one can nicely observe the different 
scaling laws with respect to N as one scans through the different enhancement regimes. 

To summarize, we have seen that, for large N, infinitesimal deviations |q !2 — || 
0{1/N) are sufficient to move away from the regime of Pythagorean enhancement, either 
to a regime of smaller enhancement or to one of an extremely large enhancement. Which 
one of the two possibilities applies depends on the angle 02 on the vertex in question, 
which is determined by two competing effects: 


• Increasing the number of dominant instantons P: this decreases the angles on some 
or all vertices of the polytope. 


• Alignment: this increases the angles on some vertices and decreases it on others. 
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Figure 3: The enhancement of the effective axion decay constant for N = 10^ and N = 10^ 
and different choices for the dihedral angle 012 . The enhancement is slow for angles 
012 < I and diverges for angles 012 > f, where the different regimes are separated by 
Aa 2 ~ 0(1/N). The red dots denote the A^-flation case 012 = |. 
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80 - 



Figure 4: The enhancement of the effective axion decay constant for hxed angle 02 = 
I + ^ and different N. For small N <C 200, we recover the Pythagorean \/N law (blue) 
since the deviation of 02 from | is smaller than 1/N. For larger N, the enhancement 
starts to grow linearly (green). Close to the limit N = 200 where 02 — f = 1/-^, 
the enhancement grows polynomially and later (super-)exponentially (violet). Note that 
there is nothing special to the value N = 200: had we chosen a smaller (larger) angle, 
the divergence would have appeared at larger (smaller) N. 


12 











3 - 

In 

fi - 


20 40 60 80 100 120 140 160 180 200 

N 

Figure 5: The enhancement of the effective axion decay constant for fixed angle 02 = 
I — ^ and different N. For small N <C 20, we recover the Pythagorean \/N law (blue), 
while for larger N the enhancement grows more slowly and finally dies out completely 
(violet). 

Hence, whether there is large enhancement on a particular vertex depends on the interplay 
between these two effects. If we add a lot of terms to the scalar potential such that P ^ N 
but only admit little or no alignment, we will have 02 < f and little or no enhancement. 
If, on the other hand, we take P to be equal to or slightly larger than N and at the same 
time allow strong alignment, we will get 02 > f and the enhancement will be huge.® Of 
course, without making quantitative the relation between the dihedral angles and physical 
quantities such as the number of instanton terms or the anomaly coefficients, the above 
results are just geometry and do not immediately improve our understanding of actual 
string theory models of inflation. We will study the relation between the enhancement 
and the instantons in more detail in Section 4. 

3.2 General polytopes 

We are now ready to discuss the enhancement on a vertex of a completely general poly¬ 
tope, where the N facets intersecting at the vertex can have different distances from 
the origin and different dihedral angles (with i,j = l,...,N and i < j). Analogous 
to what we did above for the simplihed case with a single angle Q! 2 , the enhancement can 
again be derived iteratively by means of a recurrence relation for /„. Since the expressions 

® This seems to be at odds with the results of [17], where a large enhancement was found for large 
N independent of the number of instantons P. This discrepancy is presumably due to the fact that 
the claims of [17] are of a statistical nature. Hence, their result should be interpreted such that it is 
statistically likely in the string landscape that the effect of alignment dominates over the effect of an 
increased number of constraints. This does not contradict our above statement that a regime of no 
enhancement may be reached for sufficiently large P and sufficiently small alignment. 
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Figure 6: The enhancement /tv/Zi for (012) = 0.58 > | (upper plot) and (012) = 0.56 < 
(lower plot). Each plot for cr(Q; 2 ) = 0 (blue dots) and hve realizations of a uniform 
distribution with cT(a 2 ) = 0.029 (upper plot) or (j(a 2 ) = 0.058 (lower plot). 
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Figure 7: The enhancement /Ar/(/i) for ( 02 ) = 0.58 > | (upper plot) and ( 02 ) = 0.56 < 
(lower plot). Each plot for ct(/i) = 0 (blue dots) and hve realizations of a uniform 
distribution with ct(/i) = 0.35. 
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are quite lengthy, we refrain from explicitly writing them down here and only discuss the 
results of our analysis. A detailed explanation of the iteration procedure is given in Ap¬ 
pendix A. Let us only note here that the computation is in principle straightforward and 
can easily be done also for large N using computer algebra. 

In principle, our algorithm allows to compute the enhancement of the axion decay 
constant for any multi-axion model with arbitrary form of the scalar potential. We leave 
a detailed study of the phenomenology of such models for future work and only work out 
some generic characteristic features here. 

In Fig. 6, we have plotted the enhancement Jn/Ji for different N while keeping ( 02 ) 
hxed. Here, 

^ ^ j 

denotes the expectation value of on the vertex in question. Note that this is in 
general different from the expectation value of all dihedral angles in the polytope. In 
the plot, we considered different realizations of uniformly distributed with standard 
deviation 

cr(a2) = \/(^22) - (q!2)^- (3.12) 

An interesting observation is that variations in the angles tend to increase the enhance¬ 
ment at large N compared to the case a = 0 of the previous section. More precisely, 
let us dehne P(^/> l) as the empirical probability that the enhancement for 
given N and ( 02 ) is larger for a ^ 0 than the corresponding enhancement for a = 0. We 
have explicitly verihed that > l) —)■ 1 as a is increased by testing our 

recurrence relation for different distributions of (see Table 

Another interesting observation is that the qualitative behavior of the enhancement 
is well-described already by the simple a = 0 model: as one increases N while keeping 
( 0 : 2 ) hxed, one passes through different regimes of enhancement with different scaling 
laws until one reaches an upper bound for which the enhancement diverges. This can 
again be understood geometrically: when the dihedral angles become large enough, the 
different facets intersecting at the vertex become almost parallel and the distance between 
the vertex and the origin becomes inhnite. Furthermore, the observation that we scan 
through the different enhancement regimes when N is increased and ( 0 : 2 ) is kept hxed 
also implies that the separation of the regimes in terms of ( 0 : 2 ) must go to zero for large 
N. In other words, the larger N becomes, the less we have to adjust ( 0 : 2 ) in order to 
move from one regime to another. This is again analogous to the cr = 0 case, where we 
found that the different regimes are separated by shifts in 0:2 of order 0{1/N). 

In addition to variations in the angles, we also investigated the effect of variations 
in the distances fi^ (see Fig. 7). Compared to angle variations, this effect turned out 

We stress again that this should not be confused with the landscape statistics of [17]. Here, we are not 
considering any string theory input on the likely shape of the polytope. We merely test our general 
recurrence relation for different parameter choices and then present the results in terms of statistical 
quantities. 
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a{a2) 7 ^ 0 



a = 0.007 

a = 0.029 

a = 0.058 

a = 0.029 

a = 0.35 

a = 0.52 

(« 2 ) = 1.58 

0.52 

0.88 

0.98 

0.60 

0.68 

1.00 

( 02 ) = 1.56 

0.50 

0.86 

1.00 

0.84 

1.00 

1.00 


Table 1: > l) at iV = 100 for two different values of ( 02 ), one larger 

than I and one smaller, and for two different variation patterns, one with cT(a 2 ) 7 ^ 0 and 
one with ct(/i) 7 ^ 0. As cr is increased, P(^/> l) —)■ 1 in all cases. Each case 
was tested for 50 realizations. 


to be smaller even for relatively large o'(/i). However, we found again that variations 
in /i tend to increase the enhancement instead of decreasing it, in the statistical sense 
discussed above (see Table 1). 

4 Quantum gravity constraints 

As we have discussed above, general bottom-up models with multiple axions allow a 
parametric enhancement of the effective axion decay constant for a wide range of the 
parameters. For fes ^ Afp, there is no reason to suspect that anything goes wrong 
with such an enhancement, while for larger /eg it is possible that string theory forbids 
a further enhancement due to the WGC [ 6 , 9, 11-13] (or, more generally, due to other 
possible quantum gravity constraints). If this is true, we should only expect to obtain 
scalings like those found above for moderately large N, whereas the enhancement should 
start to converge to a hnite value beyond a certain threshold number Nq. In this section, 
we will not assess the validity of the different arguments for and against a violation of a 
possible bound on the held excursion. Instead, we will try to answer the question how a 
convergence could be ensured in a given model descending from a string compactihcation 
under the assumption that the WGC forbids a super-Planckian held excursion for axions. 

4.1 General considerations 

Let us consider a string compactihcation with N > Nq axions, where Nq is a threshold 
beyond which the parametric enhancement of the ehective axion decay constant is stopped 
by virtue of the WGC. The following possibilities then come to mind: 

• The maximally allowed value for the individual decay constants of each axion de¬ 
creases for large N in such a way that the naive scaling of the ehective axion decay 
constant along some diagonal is cancelled. 

If there are more than N terms in the scalar potential, it is not clear how to assign a decay constant to 
each axion. As noted above, a more useful definition is then to denote by “individual decay constants” 
the distances of the facets of the P-polytope to the origin. 
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• The renormalization of the 4D Planck mass becomes relevant and scales with N at 
least as strongly as the naive enhancement. 

• The number N is bounded from above, i.e., string theory forbids the existence of 
compactihcations in which the number of axions exceeds Nq. 

• Additional instanton terms in the potential bound the fundamental domain to stop 
the enhancement when N grows too large, i.e., P S> iV for models with N > Nq. 

• The bound on the held excursion is violated in models with multiple axions. 

Let us discuss these possibilities in detail. The hrst option may not be compatible with 
the WGC, for the following reason. In the T-dual picture of U(l) gauge helds [13], the 
WGC imposes not only a bound on the mass of electrically charged particles but also on 
magnetically charged ones, with magnetic charge 1/g [6]. This suggests that, for axions, 
both / —)■ oo and / —)■ 0 lead to the appearance of objects with a small action that 
modify the EFT. In the limit / —)■ 0, one would expect that, instead of an instanton, the 
action of a magnetic dual becomes small, such as the Euclidean strings discussed in [12]. 
From the point of view of string theory, one furthermore observes that letting / —)■ 0 can 
lead to a loss of perturbative control. In string compactihcations, the size of / is often 
related to the volume of certain cycles of the internal manifold (see, e.g., the single-axion 
examples discussed in [5]). If one makes / too large, these cycles become small such 
that large string corrections would invalidate the EFT description. On the other hand, if 
one makes / very small, the cycles become too large and KK modes become light, again 
leading to a break-down of the EFT. It is interesting to note, however, that the individual 
decay constants in certain type IIB compactihcations exhibit an iV-dependence at hxed 
cycle volumes such that they shrink if N is made large [27]. Hence, these models seem to 
evade the simple arguments made above and precisely realize the mechanism of option 
one. In view of the above discussion, it would be important to understand how general 
this scaling behavior is and whether/how it is ahected by the magnetic weak gravity 
conjecture. We defer a more thorough analysis of these open questions to future work 
and will assume in the following that there is no fundamental obstruction to constructing 
models with multiple axions in which the individual decay constants are slightly but not 
parametrically smaller than Mp. 

The second option does not seem to solve the problem either since general arguments 
involving black holes suggest that the renormalized Planck mass grows at most like a/W 
[42]. Moreover, the effect of renormalization is actually less severe when evaluated in con¬ 
crete string models [17] (at least for those corrections that are explicitly known). Hence, 
the renormalization factor is not large enough to stop the power-law and exponential 
enhancements observed in [17, 28]. 

Option three is hard to exclude in full generality. However, one may argue that, 
even if there was an upper bound on the number of axions allowed in string compacti¬ 
hcations, this would unlikely be the mechanism responsible for stopping the parametric 
growth of /eff. After all, it is well-known that, e.g., compactihcations on Galabi-Yau 
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3-folds can easily yield several hundred or even thousands of axions such that the hy¬ 
pothetical bound Nq would have to lie above that number. This would imply a huge 
enhancement several orders above the Planck mass. We should therefore expect that 
another mechanism is responsible for stopping the enhancement way before a bound on 
N could cut it off. A possible caveat is that moduli stabilization issues might lead to 
problems already for smaller N. Such problems were discussed in [39, 40] in the some¬ 
what different context of axion monodromy inflation but are not directly applicable to 
models without monodromies (see also [43, 44] for possible effects of heavy moduli on 
the inflationary potential). In any case, the main issue discussed in these works seems 
to be tuning rather than a fundamental obstruction. This suggests that the worst-case 
scenario is that inflationary models with many axions are rare in the string landscape 
but not forbidden. 

This leaves us with the possibility that additional (multi-)instanton terms become 
relevant in the scalar potential whenever the naive held range is too large. The appearance 
of unsuppressed multi-instanton corrections was indeed the reason why engineering super- 
Planckian decay constants failed in the single-axion models of [5], and it was recently 
also emphasized as the main obstacle in the context of multi-axion models [12, 13]. It is 
therefore natural to investigate the effect of such instanton corrections in a general model 
with multiple axions. In particular, it should be interesting to know how many dominant 
instanton corrections would actually have to appear in the scalar potential in order to 
forbid a parametric enhancement of the effective axion decay constant. 

A natural guess is that higher harmonics of D-brane instantons (i.e., multi-wrapped 
D-branes) may become unsuppressed at large N and thus bound the held range. However, 
in order for this mechanism to be ehective, multi-instantons with a huge charge would 
have to become relevant in the scalar potential. For example, in the simplest case of N- 
hation, cutting oh the diagonals with /eg ~ \fN would require the multi-instantons with 
charges (±1, ±1, ±1,..., ±1) under the N axions to have an action of order 1. Hence, 
the actions of the leading instantons (i.e., those with unit charge) would have to be tiny, 
Se ~ l/y/N. In a general model with power-law or exponential enhancement, the actions 
would have to be even smaller. Since the action of a D-brane scales with the volume of 
the wrapped cycle and inversely with the dilaton, this is expected to be the case only at 
extremely small volumes or at a huge string coupling, and thus we would be far away 
from the perturbative regime of string theory. In a perturbative string compactihcation 
with /« < Mp, on the other hand, the instanton actions should be much larger. Indeed, 
gravitational (multi-)instantons, which are conjectured to be the low-energy effective 
descriptions of D-brane (multi-)instantons [12], have been shown in this regime to be 
highly suppressed at large N such that they would not spoil enhanced field excursions in 
a potential generated by other instantons [12, 14]. 

It is in principle conceivable that string theory simply does not admit any perturba- 
tively controlled compactihcations with many axions. However, it is a priori not clear 
why this should be the case (cf. bullet point 3). Moreover, there exist proposed counter¬ 
examples in which higher harmonics of the known instanton effects are subdominant 
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and super-Planckian field excursions appear to be possible [17, 15] Assuming that 
the approximations made in these constructions are trustworthy, one may then wonder 
what could possibly happen in such a model in order to evade the conclusion that super- 
Planckian held excursions are allowed in string theory. If the WGC implies a bound on 
the held excursion, it would then predict additional non-perturbative ehects that would 
have to become dominant in the scalar potential. 

It is not the goal of this paper to perform explicit string theory computations to shed 
light on this important issue. Instead, we are interested in drawing model-independent 
conclusions from an analysis of the geometric properties of the fundamental domain of the 
axion moduli space, analogous to what we did in Section 3. Such a held theory analysis 
can obviously not decide whether the WGC holds in any of its forms. Proving or disprov¬ 
ing this would require input from quantum gravity (e.g., via an analysis of instantons 
in string theory), but a held theory computation may still be useful in determining the 
general conditions under which the held excursion can be bounded. Motivated by the 
above discussion, we will specihcally be interested in the question how many dominant 
instantons would have to be provided if the WGC indeed bounds the held excursion. 

In the next section, we will focus on precisely this situation, i.e., we assume a model 
with N axions where the individual decay constants are sub-Planckian but not para¬ 
metrically smaller than Mp. We then ask how many dominant instantons would have to 
contribute to the scalar potential in such a model to ensure that there is no parametrically 
enhanced diagonal anywhere in held space. We will hnd evidence that this number grows 
exponentially with N such that string theory would have to provide an enormous number 
of extra instanton corrections on top of the known non-perturbative ehects, which are of 
order N in typical string compactihcations in regimes of perturbative control. 

To summarize, it is not clear whether the alternatives discussed in this section suffice 
to explain a bound on the held range in string compactihcations with many axions. This 
may suggest the existence of a loophole such that super-Planckian held excursions are 
actually allowed in string theory when the number of axions is large. 

We should nevertheless stress that we have certainly not excluded any of the possi¬ 
bilities discussed in this section but merely pointed out several difficulties which are not 
explained by what is currently known in the literature. In order to understand each of 
these mechanisms in more detail, it is clearly important to analyze explicit string com¬ 
pactihcations and check whether some of the above discussed problems are evaded. Such 
an analysis is beyond the scope of the present work. 

The field excursion in the model of [17] is only marginally super-Planckian with A(j> « l.lSMp. It 
would be important to see whether similar models with larger field ranges can be constructed. It has 
furthermore been pointed out that the model might be subject to dangerous corrections potentially 
spoiling the large field excursion [13]. A possible worry about the model of [15] is that it is a string- 
inspired scenario rather than a fully explicit string theory computation. 
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4.2 Relation between instantons and enhancement 

As discussed above, let us now consider a model with N axions in which the individual 
decay constants are smaller but not parametrically smaller than Mp. We would now 
like to understand how many dominant instantons P have to contribute to the scalar 
potential in order that the effective axion decay constant is not parametrically enhanced 
with N}^ 

So far, we have only studied “local” properties of the iV-polytope bounding the axion 
moduli space without any input from string theory. In particular, we analyzed in Section 
3 how the enhancement in the direction of a particular vertex depends on the distances 
and dihedral angles of the N facets intersecting at that vertex. Now we would like to 
understand the “global” problem of how the enhancement is related to the total number of 
facets 2P of the polytope, which is directly related to the number of dominant instanton 
corrections in the scalar potential. 

What result should we expect? As a simple example, consider a model in which 
P = N and there is no alignment. The polytope is then an iV-cube with Pythagorean 
enhancement /eg ~ \/N. How many extra instanton terms would have to appear in the 
scalar potential in order to evade a parametric growth of the enhancement at large iV? 
A natural guess is that one would at least have to cap off each of the 2^ vertices of the 
cube by adding a further “diagonal” facet. Hence, if there is a bound on the axion field 
excursion, the true fundamental domain consistent with quantum gravity would have to 
be a polytope with P ~ 2^ instead of P = A^. This would imply that string theory must 
generate an exponentially large number of dominant instantons in such a model. In the 
following, we will argue that a similar conclusion actually holds for generic polytopes. 

To see this, we will proceed in two steps. First, we will combine known results in the 
math literature with some general considerations to relate the distribution of the angles 
a 2 ^'^ to P. We will then use our general recurrence relation to determine the convergence 
radius of Jn in terms of P, i.e., we will analyze how large P has to be chosen in order that 
/at converges to a hnite value at large N. We will not attempt to give an analytic proof 
of the convergence radius since this is extremely difficult without specifying a particular 
polytope. Instead, we will use our recurrence relation to test the enhancement for several 
classes of generic polytopes and see whether it converges. 

For simplicity, let us restrict to the case where the distances of all facets of the polytope 
from the origin are equal, i.e., = /i, and only variations in the dihedral angles are 

allowed. This is physically reasonable as we typically expect the to be of roughly 
the same size. Furthermore, we have shown that introducing variations in the distances 
increases the enhancement such that our results below should be considered as providing 
a lower bound on the nnmber of required instantons. 

Our computation does not depend on the physical origin of the instanton corrections such that it 
would also apply to a situation where higher harmonics provide some of the P dominant contributions 
to the scalar potential. However, as stated above, evidence in the literature suggests that these are 
highly suppressed for the known instantons in the regime of interest. 
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There are several results in the math literature which will be useful for our purpose. 
First, under the above assumption, we can map our problem of interest to a well-known 
statistics problem, namely the properties of randomly chosen points on the unit iV-sphere. 
Taking these points to be the endpoints of the 2P normal vectors dehning our polytope, 
their distribution is all we need to determine the dihedral angles and, via our recurrence 
relation, the enhancement of the effective axion decay constant. We choose a uniform 
distribution of points on the hypersphere, which should correspond to the least amount 
of alignment, again giving a lower estimate for the expected enhancement of a general 
polytope. It was proven in [45] that, for large P, the probability density for the angles 
between the normal vectors is then given by 

P(0) = ^A^(sin0)"^-^ (4.1) 

y^r(—) 

Using that the total number of angles between the 2P normal vectors is one can 

furthermore show [45] that 

©min arccos ©max —)■ TT — arCCOS — e“^^ (4.2) 


in probability, where ©min and ©max are the smallest and the largest angles between any 
two normal vectors in the polytope (i.e., those angles for which the probability drops to 
p( 2 P-i) )' polynomially growing P, this implies 


© 


min ^ 2 



©. 





(4.3) 


Hence, for polynomially growing P, all angles are likely to be very close to |, whereas, for 
exponentially growing P, the difference ©max — ©min converges to a hnite value and thus 
also allows smaller or larger angles. For large N and small | f ~ © |, one can furthermore 
show [45] that (4.1) converges to 


p(©) 




exp 
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(4.4) 


Hence, for polynomially growing P, the angles are normally distributed. 

For large N, the above results determine the distribution of the angles between any 
pair of normal vectors in the polytope. But what about the distribution of dihedral angles, 
i.e., the angles between those normal vectors whose facets intersect? If the number of 
facets is very small, they can effectively bend around the iV-sphere such that every facet 
can intersect with every other facet in the polytope. In general, however, this will not 
be the case. The number of dihedral angles is therefore in general smaller than the total 
number of angles, and we should expect this ratio to become smaller as P/N is increased. 
In an Wsimplex, for example, the number of facets is very small, 2P = iV -|- 1, and 
one can convince oneself that the number of dihedral angles equals the total number of 
angles (). In an Wcube, on the other hand, one has 2P = 2N. The total number of 
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Figure 8: Schematic plot of the angle distribution at large N for two polytopes with 
polynomially growing P (left and middle picture) and one with exponentially growing P 
(right picture). The deviation of the minimal and maximal angles from | goes to zero at 
large N unless P grows at least exponentially. The ratio of dihedral angles (shaded area) 
to the total number of angles typically gets smaller as P/N gets larger. 

angles is then (^1^), where N of them equal 180° and the rest equals 90°. The number 
of dihedral angles is thus (^^) — N. 

In general, the number of dihedral angles in a polytope heavily depends on its topology. 
It is therefore not possible to come up with a general formula unless we specify exactly 
the polytope we consider. Since the number of dihedral angles is given by the number of 
intersections of {N — l)-facets, it equals the number of (A^ —2)-facets, which we will denote 
by Fn -2 in the following. Hence, we can get a (conservative) estimate for the dihedral 
angles at a given vertex by drawing them from the Fn -2 smallest angles among the total 
(‘^2 ) angles in the polytope, which in turn are drawn from the normal distribution (4.4) 
(see Fig. 8). This implies that, for a given N^ the smaller the ratio is between Fn -2 and 
(^1")) smaller are the dihedral angles and, hence, the enhancement. 

One way to determine Fn -2 is to consider specific examples for which the face numbers 
are known and then study random polytopes under the assumption that their Fn -2 is 
the same. These could either be deformations of the known examples with the same 
topology or, more generally, other polytopes which happen to have the same Fn-2- Two 
simple classes of examples are the A^-cube with 2P = 2N and Fn -2 = 2N{N — 1) and 
the cross-polytope with 2P = 2^ and Fn -2 = \N2^. In addition, we consider two 
classes of polytopes with P ~ N"^, namely the so-called associahedra and cyclohedra 
(see, e.g., [46, 47] for their exact definitions and properties). One finds that cyclohedra 
have 2P = N{N + 1) and Fn -2 = |(-^ + 2)(A^ -|- 1)N{N — 1), while associahedra have 
2P = lN{N + 3) and Fn -2 = + 4)(Ar + ?,)N{N - 1)}^ 

A more general approach is to make use of a lower bound on Fn- 2 , which is available 
for a large class of polytopes called simple polytopes. These are polytopes whose vertices 

These numbers can be determined from so-called /-vectors or /i-vectors, which encode the topology of 
polytopes. An explicit expression for the /-vector of a cyclohedron can be found, e.g., in [48], and the 
h-vector of the polytope dual to an associahedron is given in [47]. 
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Figure 9: An estimate for the enhancement on typical vertices of polytopes with P ~ TV 
for different TV, where the codimension 2 face number equals that of an TV-cube (blue) or 
saturates the lower bound theorem (red). 
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Figure 10: An estimate for the enhancement on typical vertices of polytopes with P ~ N'^ 
for different N, where the codimension 2 face number equals that of an associahedron 
(blue) or a cyclohedron (violet) or saturates the lower bound theorem (red). 
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Figure 11: An estimate for the enhancement on typical vertices of a polytope with P ~ 2^ 
for different N, where the codimension 2 face number equals that of a cross-polytope. 


have degree N, i.e., N facets intersect at each vertex. In fact, this is not a strong 
restriction for our purpose since we are only interested in the enhancement. In particular, 
any polytope can be made into a simple polytope by truncating all vertices with degree 
higher than N (i.e., cutting these vertices off). If the truncation is done inhnitesimally 
far away from a vertex, the enhancement in this direction does not change. Hence, our 
computations should be valid for any kind of polytope. A useful theorem is then the 
so-called lower bound theorem, which was proven in [49, 50] and implies 


Fn-2 ^ 2NP — 


N{N+1) 

2 


( 4 . 5 ) 


for all simple polytopes. The downside of this method is that the above bound is not very 
strong in the sense that it often massively underestimates the number of dihedral angles 
(and, hence, the enhancement). Taking as an example the associahedra and cyclohedra 
mentioned above, the actual number of dihedral angles scales like while the lower- 
bound theorem only gives a scaling N^. 

Following the strategy outlined above, we have plotted the enhancement along a 
typical vertex of several classes of example polytopes for different N, where “typical” 
refers to the statistical approach explained above. We considered random deformations 
of the cube (P = N) and the cross-polytope (P ~ 2^) as well as two classes of random 
polytopes with P ~ N'^ whose codimension 1 and 2 face numbers (i.e., 2P and Fn- 2 ) 
were assumed to equal those of associahedra or cyclohedra (see Figs. 9-11). Note that we 
did not have to explicitly construct all these polytopes. Rather, we used our recurrence 
relation to compute the enhancement along a single vertex whose dihedral angles are 
determined using the algorithm described above. 

Not surprisingly, one observes that, for the case P = N, the enhancement grows 
parametrically at large N. Since almost all angles are dihedral angles for an A^-cube 
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at large N, it is likely that many of them are larger than As was shown in Section 
3, we then expect a wide range of possible enhancements, which is indeed reproduced 
by the upper plot in Fig. 9.^^ More interestingly, we can see from Fig. 10 that even a 
quadratic growth P ~ N'^ is not sufficient to stop the parametric enhancement. Although 
the large number of facets has the effect of making most of the dihedral angles smaller 
than |, this is apparently not enough to stop the enhancement. In Section 3, we argued 
that deviations || — ( 02 ) | ~ 0(1/N) can already be sufficient to move away from the 
Pythagorean regime to a regime of slower enhancement, but we did not specify a precise 
criterion for when the enhancement stops entirely. Fig. 10 shows that having as much 
as P ~ facets is not enough. On the other hand, we can see from Fig. 11 that, 
as expected, an exponential growth of P has the effect that the enhancement dies out 
completely. In fact, the number of facets then grows so fast that the enhancement even 
falls for large N. 

In addition to these examples, we made use of the lower bound theorem to also get 
an estimate of the enhancement in completely general simple polytopes (lower plots in 
Figs. 9 and 10). The plots show that the enhancement then grows rather slowly. As 
explained above, this is not surprising as (4.5) bounds the expected enhancement from 
below in a rather crude way. The interesting point to notice is, however, that, even under 
our very conservative assumptions, there is still no sign of convergence. This provides 
further evidence that not even a quadratic growth of P with respect to N is sufficient to 
stop a parametric enhancement of the axion held range. 

Intuitively, these results can be understood from the interplay of two effects. First, 
when P is made large while keeping N hxed, the ratio of the number of dihedral angles 
to the total number of angles tends to get smaller such that the values of the dihedral 
angles move to the left-hand side of the normal distribution (see Fig. 8). Hence, large 
P tends to make the enhancement smaller for a given N. Second, at large N, almost all 
angles gather around the value |, where the width of the distribution is inhnitesimally 
small unless P grows at least exponentially with N. Hence, for polynomially growing P, 
the dihedral angles cannot deviate much from | such that only an exponential growth of 
P is effective in bounding the enhancement. This is of course not a proof but it justihes 
the expectation that an enormous number of instanton corrections would be required in 
order to restrict the axion held range to sub-Planckian values. 

One might argue that the slow growth of the axion held range observed in the examples 
with P ~ N'^ is nothing to worry about. However, for sufficiently large N, this would still 
lead to a violation of any bound on the held excursion. We should also again emphasize 
that our estimate for the enhancement was very conservative and in many cases heavily 
underestimates the P necessary to stop the enhancement. As stated above, one reason 

In fact, several realizations showed an infinite amount of enhancement because the angles were so large 
that the different facets could not “close” into a convex object anymore. This is consistent with Section 
3, where we found that small deviations from ^ of order 0{1/N) can already make the enhancement 
diverge, and with [28], where it was argued that a moderate tuning of the anomaly coefficients can 
already lead to an exponentially large enhancement at large N. 
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is that we used the lower bound (4.5) on the number of dihedral angles for some of the 
plots, which is general but not very strong. Another reason is our assumption that the 
dihedral angles are always the smallest angles in the distribution of all angles, which 
need not be true in general. Furthermore, we computed the enhancement for a typical 
vertex, which we specihed by drawing (^) dihedral angles out of the set of all Fm-2 
dihedral angles of the polytopes in question. However, a vertex determined this way is 
not necessarily the vertex with the largest enhancement in a given polytope. Finally, 
we did not take into account additional enhancement due to variations in the distances 
Hence, the above results should be seen as a very conservative lower estimate of the 
enhancement expected in a general polytope. Even in this restrictive situation, we found 
that a quadratic relation between P and N is not sufficient. 

Another point is that, in string theory, one does not have the freedom to choose a 
polytope with the “optimal” relation between the number of facets and the enhancement 
produced. Hence, even if a class of polytopes existed at large N for which the enhance¬ 
ment already converges with, say, P ~ (which is still a huge number), it would not 
be guaranteed that the axion moduli space in actual string theory models would have 
a fundamental domain in the shape of such a polytope. A typical situation in a string 
theory based model is that a certain number of instanton corrections, say N, are already 
known to exist and be dominant. If these corrections are the leading instantons for the 
modes to which they couple and by themselves yield a fundamental domain in the shape 
of an A^-cube (which is not far-fetched), it is difficult to escape the conclusion that the 
number of additional instantons required to cut off the Pythagorean enhancement would 
have to grow exponentially (cf. the discussion at the beginning of this section). 

This is all not very surprising from the geometric point of view but it is interesting 
from the physics perspective. Let us emphasize here that P is not equivalent to the 
total number of (multi-)instanton corrections to the scalar potential but to the number 
of dominant instanton corrections (i.e., those instantons which lead to non-negligible 
contributions to the scalar potential and thus bound the held range). Of course, the 
total number of multi-instantons can be inhnite even in a model with only one axion 
but this is not what we claim here. Furthermore, each of the P instantons must couple 
to a different linear combination of the N axions in order to create an additional facet 
in the polytope bounding the fundamental domain. If one considers the alternatives 
discussed in Section 4.1 unreasonable, our result thus suggests two different scenarios: 
either models with multiple axions are capable of exploiting a loophole that allows them 
to have super-Planckian held excursions or they must involve an enormous number of 
dominant instanton terms in the scalar potential. If quantum gravity indeed bounds the 
axion held range to be sub-Planckian in models with many axions, one would then have 
to explain where all the dominant instanton corrections come from. 


5 Discussion 


In this paper, we studied the maximally allowed held excursion in models of inhation with 
multiple axions. We hrst showed that general bottom-up models admit a wide range of 
different regimes of enhancement, depending on the dihedral angles and distances of 
the facets in the polytope which bounds the fundamental domain of the axion moduli 
space. We also argued that the values of these parameters are related to the amount of 
alignment and the number of instanton terms in the scalar potential. We then asked the 
question which subset of these models is compatible with quantum gravity. We argued 
that models with a large number of axions must either involve an enormous number of 
dominant instanton terms in the scalar potential or be capable of super-Planckian held 
excursions by virtue of a loophole. Whether or not such a loophole exists has been heavily 
debated in the recent literature. The various arguments in favor of and against a strict 
bound on the axion held range are based on general quantum gravity considerations [6, 9- 
11, 13, 16] as well as on explicit studies of instanton solutions [12, 14] and string theory 
models [9, 11, 17, 14, 15]. 

In this paper, we took a diherent approach to the problem. Our aim was to draw 
general, model-independent conclusions from the hypothesis that the WGC forbids a 
parametric enhancement of the axion held range, based on a purely geometric reasoning. 
We found that, in order for the enhancement to converge to a hnite value at large N, the 
number of dominant instanton terms in the scalar potential would have to grow faster 
than quadratically, presumably even exponentially, with N. In the perturbative regime 
of string theory, this is in stark contrast to the usual assumption of a linear relation 
between the number of instanton terms and the number of axions. To our knowledge, 
explicit string compactihcations show no evidence that such an enormous number of 
dominant instanton corrections should be expected. For example, the model of [51], 
which was recently reconsidered in [17] in the context of axions, has = 51 and P = 60. 
A possible caveat is that works on magnetized instantons [52] show that the number of 
non-perturbative corrections to the scalar potential is often larger than expected from 
a “naive” counting of the rigid divisors. Furthermore, it was emphasized in [12] that 
also non-BPS instantons can contribute to the scalar potential in non-supersymmetric 
situations like inflation. Nevertheless, it is questionable whether these arguments could 
explain a number as huge as required by our hndings. In particular, if the WGC bounds 
the field excursion, this is expected to be true also in supersymmetric setups even though 
they are not suitable for cosmology. Furthermore, it was argued in [14] that the actions of 
the gravitational instantons of [12] generically scale with powers of N in large-iV models 
such that the corresponding corrections to the scalar potential tend to be suppressed. 

To summarize, we anticipate that our result can be interpreted in two different ways. 
If one insists that the held excursion should be strictly bounded in models consistent with 
quantum gravity, it implies that bottom-up models of large-held inhation with multiple 
axions lie in the swampland since their assumptions on the number of terms in the scalar 
potential are false. Consistent models would instead have a much larger amount of dom- 
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inant instanton corrections, which is only expected in non-perturbative regimes of string 
theory. On the other hand, assuming that large-iV models and alignment mechanisms 
can be embedded in perturbative regimes of string theory, one may argue that it is un¬ 
reasonable to expect a number of dominant instantons as huge as required by the above 
results. This suggests that a more reasonable interpretation of our results is to conclude 
that axions must be capable of violating bounds on the field excursion (imposed by the 
WGC or any other quantum gravity constraint), e.g., by exploiting one of the loopholes 
hypothesized in [10, 12, 13]. 

For future work, it would be interesting to further elaborate on the arguments put 
forward in this paper. In particular, it would be nice to get more precise results for the 
convergence radius of /es in terms of P. It might also be interesting to revisit some of 
the alternatives discussed in Section 4.1 in order to see whether axions can somehow 
evade our conclusions. Although we have given arguments for why we consider these 
alternatives unlikely, it would be useful to make these arguments watertight or else find 
possibilities to circumvent them. In view of our results, it would also be important to 
gain a better understanding of instanton solutions in general string compactifications 
with gravitational and gauge degrees of freedom (see also [16] for a discussion of this 
point) and to construct explicit string theory models to check our claims and those of 
[12, 13] in more detail. We hope to report on further progress on these open questions in 
future work. 
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A Enhancement for a general polytope 

In this section, we compute the enhancement in the direction of a vertex of a completely 
general polytope, where the N facets intersecting at the vertex can have different distances 
from the origin and different dihedral angles (with i, j = 1,..., Y and i < j). 
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A.l Trigonometry 

We first collect some trigonometric identities that will be useful below. The situation 
relevant for us is depicted in Fig. 12. Our goal is to compute the lengths of d and e 
depending on the lengths of d,b,c and on their mutual angles ip, ip, 9. All relations we 
will need below for the enhancement are then special cases of this situation. 

Let us choose our coordinate system such that 


( aA /a_Li\ ^ / 1 \ /cos'ipX ^ /cos A /eA 

02 , o± = a ±2 , ^ ^ 0 , c = c sin -i/ , d = d sin /3 , e = 62 . 

03 / W3J Vo/ Vo/ Vo/ \esj 

(A.l) 

From a ■ a = a?, a ■ b = ab cos p and a ■ c = ac cos 9, we hnd 


cos 9 — cos ^|J cos p 

ai = acosp, 02 = o-^-, 

sin-^ 

cos^ p + cos^ 9 — 2 cos tp cos p cos 9 
sin^ ip 



Furthermore, we have 

b o 

- = cos p, 
d 

which can be solved for d and (3, 


c 

d 


cos('0 — (3), 


d 


a/ 62 + — 26c cos -0 

sin-^ 


6 sin Ip 

p = arccos ^=. 

y 62 + c 2 — 26c COS'!/ 


(A. 2 ) 

(A.3) 


(A.4) 


(A.5) 


We now compute the angle 7 between a and the {z, d) plane. In order to hnd it, we 
dehne a vector d±, which lies in the (z, d) plane such that its angle with a is minimized. 
Demanding that d±_ lies in the (F, d) plane amounts to having 


F X dj_ = 




Hence, we can make the ansatz 


a± = a± 



(A. 6 ) 


(A.7) 


where a± is the length of the vector and A is a yet unknown number. For later convenience, 
we choose the length a± such that d± touches the intersection of the {z, d) plane with the 
plane orthogonal to a (see Fig. 12), 


a 

a± = -. 

cosy 


(A. 8 ) 


31 











The angle 7 between a and a± is determined by 

a ■ d± = aa± cosy = a± ^OiA cos 13 + a 2 X sin (3 + , 


which yields 


7 = arccos 


aiA cos /3 + a 2 \sin [3 + a^y/l — A^ 


Minimizing with respect to A, we hnd 


A = 


ai cos /3 + 02 sin (3 


a/( oi cos /d + 02 sin /3)2 + a| 
The angle 7 between a and d is given by 

d ■ d = ad cos 7] = aid cos [3 + 02 ^ sin (3, 

yielding 

Oi cos /3 + 02 sin [3 

7] = arccos-. 

a 

The angle ^ between d± and d is given by 

d± ■ d = a±dcos ^ = a±d\. 

Hence, 


(A.9) 

(A.IO) 

(A.ll) 

(A.12) 

(A.13) 

(A.14) 


.^ = arccos A. (A.15) 

The vector e also lies in the {z, d) plane. We only need its length, which is given by 

a/o 5 _ + — 2a±d cos ^ 


e = 


sin,^ 


(A.16) 


Finally, we will need to know the angle y between dj_ and a vector g, which reaches into 
a fourth dimension, 

/ gi\ 


9 = 


92 

93 

\9.J 


(A.17) 


and whose components are specihed in terms of its length g and its mutual angles with 
d, c, d, 

g^ = g^, d-g = ag cos 5, c-g = cgcos(, d-g = dgcosu. (A.18) 

Using these relations to compute the components of g, we can substitute the latter into 
■ g = ci.±g cos X to hnd 

i/l — A^ a cos 5 


X = arccos 


A cos u + 


as 


Vl — A^ [cosa;(ai sin'^ — 02 cost/j) — cosC(ai sin {3 — a2 cos ( 3 )] 
a3(cos'^ siu/S — cos/d sin'^) 


. (A.19) 
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Figure 12: Vectors and their mutual angles. 


A.2 Recurrence relation 


Let us now compute the enhancement of the effective axion decay constant for N = 2 and 
N = 3. Hence, we consider a vertex dehned by the intersection of three planes (see Fig. 
13). Let us denote the normal vectors to these planes by fl^\ fl^\ fl^\ their distances to 
the origin by and their dihedral angles by a^‘^\ Furthermore, 

/2 denotes the vector which is normal to the intersection of the two planes that are 
parallel to the z axis, and /a is the vector pointing to the vertex, with lengths /2 and /a, 
respectively. Finally, denotes the vector which is normal to the intersection of the 
{z,f 2 ) plane with the plane normal to fl^\ and da^^^^ is the angle between 
Our goal is to compute /2 and /a in terms of and 

By comparing Fig. 13 and Fig. 12, one notes that this computation is identical to 
the one in the previous section upon identifying 


and 


a = ff\ a± = flf b = ff\ c = fY\ d = f2, e = fy 


/ (12) (23) n (13) t ~(123) 

V = a 2 ) ^ = e = a\ \ 4 = «3 • 


Hence, for N = 2, we hnd 


(A.20) 

(A.21) 


/2 = 




COSttn 


sm a. 


( 12 ) 


(A.22) 


For N = 3, the expression is already rather lengthy. 


fs — 


/f' \ , f2 o ’ f ~ (123) 

211— + to — 2^11— fy COS a\ ’ 

cos 7 / ' 2 COS O 


COSfy *• 


sina 


(123) 


(A.23) 
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N = 2 N = 3 N = 4 

Figure 13: Normal vectors and the angles between them in a general iV-polytope for 
N = 2, N = 3 and = 4. 


with 


~(123) 

CTg = arccos 


ai cos (3 + a 2 sin (3 


(ai cos (3 + a 2 sin (3)“^ + a| 

(13) (12) (23) 

n = f(3)po^^(23) ^ _ ( 3 ) COSa^ ^ - COS ^ COS 

CLi COS 0^2 ; (I 2 Jl 0 91 ’ 


( 12 ) 
sm 02 


„ _ f(3) 

03 — Jl 

(3 = arccos 




2 (23) , 2 (13) o (12) (23) (13) 

cos^ aX + cos^ ttn — / cos an cos an cos an 


• 2 (12) 
sm a 2 


l'(2) • C 

Ji sm ^2 


( 12 ) 


7 = arccos 


V(/tF+(/fy-2/7/hcosa7>’ 

(ai cos /3 + a 2 sin 


n 


(3) 


(A.24) 

(A.25) 

(A.26) 

(A.27) 

(A.28) 


For general N, the enhancement can be obtained iteratively by repeating the above 
computation for successive 3-dimensional subspaces of the full A^-dimensional moduli 
space. This is illustrated in Fig. 13. Analogous to the notation used above, we now 
denote normal vectors to {N — ?T,)-facets by /„ and their lengths by /„. Furthermore, 
we define a vector in each 3-dimensional subspace by demanding that it lies in a 
plane orthogonal to /J" and fn -2 for each n > 4 (see Fig. 13). We also have to 
compute various angles between the different normal vectors, for which we choose the 
notation ai'"^ = Z(/i,^_i) and dn"^ = fn-i)- To avoid ambiguities, we indicate 

the subspace in which the angles are computed in superscript, e.g., denotes an angle 
between normal vectors that lie in the 124 directions. 
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Each iteration n = 4,..., iV then involves three steps. First, comparing Fig. 13 and 
Fig. 12, we can identify 


and 


a = ft\ =/!?, b = fll^\ c = fn-2, d = fn-u e = fn 


I ~(123...n—1) ~(n—l,n) q (123...n—2,n) * ~(123...n) 


(A.29) 


(A.30) 


This gives us /„ in terms of fn-i and our input parameters Second, in order 

to obtain the angles which will be required in the next step, we have to 

consider 

a = /J™\ a± = jf^\ b = c = fn-2 (A.31) 

and 


(A.32) 


for m = n + 1,..., A^. Finally, in order to obtain the angles required for the next 

iteration, we have to consider, again for m = n + 1 ,..., A^, 


9 = f\^\ 


UJ = al^23...n-l,m) 


c = 


(123...n—2,m) 


f- (n,m) 

b = a\ \ 


~ (n,m) 

X = «2 4 


(A.33) 


where a, 6 , etc. and if, (p, etc. are dehned as in (A.29) and (A.30). 

Using this iteration rule together with the expressions computed in the previous sec¬ 
tion, we obtain a recurrence relation for /„ in terms of fn-i- It is then straightforward to 
automatize the iteration steps using computer algebra and determine the enhancement 
/v/(/i) for large N. The special case discussed in the main text is recovered by setting 
= /i, = a 2 - This considerably simplihes the recurrence relation since it implies 

d± = a and, hence, 7 = 0 by symmetry. 
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